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Abstract. We prove that if Ft is a principal ideal ring and A S M n (i?) is a 
matrix with trace zero, then A is a commutator, that is, A = XY — YX for 
some X, Y £ M n (i?). This generalises the corresponding result over fields due 
to Albert and Muckenhoupt, as well as that over "L due to Laffey and Reams, 
and as a by-product we obtain new simplified proofs of these results. We also 
establish a normal form for similarity classes of matrices over PIDs, generalising 
a result of Laffey and Reams. This normal form is a main ingredient in the 
proof of the result on commutators. 



1. Introduction 

Let R denote an arbitrary ring. If a matrix A s M„(i?) is a commutator, that 
is, if A = [X,Y] = XY - YX for some X,Y e M n (R), then A must have trace 
zero. The problem of when the converse holds goes back at least to Shoda [T3] who 
showed in 1937 that if K is a field of characteristic zero, then every A g M n (K) 
with trace zero is a commutator. Shoda's argument fails in positive characteristic, 
but Albert and Muckenhoupt pQ found another argument valid for all fields. The 
first result for rings which are not fields was obtained by Lissner [8] who proved 
that if R is a principal ideal domain (PID) then every A € M2 (R) with trace zero 
is a commutator. A motivation for Lissner's work was the relation with a special 
case of Serre's problem on projective modules over polynomial rings, nowadays 
known as the Quillen-Suslin theorem (see [H Sections 1-2]). Lissner's result on 
commutators in M2 (R) for R a PID was rediscovered by Vaserstein [19] and Rosset 
and Rosset [13], respectively. Vaserstein also formulated the problem of whether 
every A G M„(Z) with trace zero is a commutator for n > 3 (see Q1J1 Section 5]). A 
significant breakthrough was made by Laffey and Reams f6 who settled Vaserstein's 
problem in the affirmative. However, their proofs involve steps which are special to 
the ring of integers Z and do not generalise to other rings in any straightforward 
way. The most crucial step of this kind is an appeal to Dirichlet's theorem on 
primes in arithmetic progressions. The analogue of Dirichlet's theorem, although 
true in the ring F g [x], fails for other Euclidean domains such as C[x] or discrete 
valuation rings. Nevertheless, in [5] Laffey asked whether any matrix with trace 
zero over a Euclidean domain is a commutator. Until now this appears to have 
remained an open problem even for n = 3, except for the cases where R is a field 
or Z. 

In the present paper we answer Laffey's question by proving that if R is any 
PID and A G M„(i?), is a matrix with trace zero, then A is a commutator. This 
is achieved by extending the methods of Laffey and Reams and in particular re- 
moving the need for Dirichlet's theorem. Another of our main results is a certain 
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(non-unique) normal form for similarity classes of matrices over PIDs, itself a gen- 
eralisation of a result proved in [6] over Z. The normal form, while interesting in 
its own right and potentially for other applications, is also a key ingredient in the 
proof of the main result on commutators. 

We now describe the contents of the paper in more detail. In Section[2]we define 
regular elements in M„(i?) for an arbitrary ring R and state some of their basic 
properties. Regular elements play a central role in the problem of writing matrices 
as commutators because of the criterion of Laffey and Reams, treated in Section [3] 
The criterion says that if R is a PID and A, X £ M„ (R) with X regular mod 
every maximal ideal of R, then a necessary and sufficient condition for A to be a 
commutator is that tx(X r A) = for r = 0, 1, . . . , n — 1. This was proved in |6J for 
R = Z, but the proof goes through for any PID with only a minor modification. 

In Section |4] we apply the Laffey- Reams criterion for fields to give a short proof 
of the theorem of Albert and Muckenhoupt mentioned above. We actually prove 
a stronger and apparently new result, namely that in the commutator one of the 
matrices may be taken to be regular (see Proposition |4]). 

Section [5] is concerned with similarity of matrices over PIDs, that is, matrices 
up to conjugation by invertible elements. Our first main result is Theorem 15.61 
stating that every non-scalar element in M„ (R) is similar to one in a special form. 
This result was established by Laffey and Reams over Z. However, a crucial step 
in their proof uses the fact that 2 is a prime element in Z, and the analogue of 
this does not hold in an arbitrary PID. To overcome this, our proof involves an 
argument based on the surjectivity of the map SL„(i?) — > SL„(i?/J) for an ideal /, 
which in a certain sense lets us avoid any finite set of primes, in particular those 
of index 2 in R (see Lemma |5.1[) . This argument is evident especially in the proof 
of Proposition 15.31 Apart from this, our proof uses the methods of [5], although 
we give a different argument, avoiding case by case considerations, and have made 
Lemma 15.21 explicit. 

Our second main result is Theorem 16.31 whose proof occupies Section [6l and 
follows the lines of Section 4]. There are two new key ideas in our proof. First, 
there is again an argument which at a certain step allows us to avoid finitely many 
primes, including those of index 2 in R. This step in the proof is the choice of q and 
uses a special case of Lemma l6.l|^[ Secondly, we apply Lemma ROl to obtain a set of 
generators of the centraliser of a certain matrix modulo a product of distinct primes; 
see (|6.7I) . It is this set of generators together with our choice of q and an appropriate 
choice of t in (|6.9p which allows us to avoid Dirichlet's theorem. It is interesting to 
note that the proofs of our main results, Theorems l5.6l and l6.3l despite being rather 
different, both involve the technique of avoiding finitely many primes, in particular 
those of index 2 in R. Our proof of Theorem 16.31 also simplifies parts of the proof 
of Laffey and Reams over Z since we avoid some of the case by case considerations 
present in the latter. By a theorem of Hungerford, Theorem 16.31 once established, 
easily extends to any principal ideal ring (not necessarily an integral domain); see 
Corollary [631 

The final Section [7] discusses the possibility of generalising Theorem 16.31 to other 
classes of rings such as Dedekind domains, and mentions some known counter- 
examples. 

We end this introduction by mentioning some recent work on matrix commu- 
tators. In [10] Mesyan proves that if R is a ring (not necessarily commutative) 
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and A G M n (R) has trace zero, then A is a sum of two commutators. This result 
was proved for commutative rings in earlier unpublished work of Rosset. In [4] 
Khurana and Lam study "generalised commutators", that is, elements of the form 
XYZ - ZYX, where X,Y,Z<E M n (R). They establish in particular that if R is a 
PID, then every element in M n (R), n > 2, is a generalised commutator. Although 
these results may seem closely related to the commutator problem studied in the 
present paper, the proofs are in fact very different. 

Notation and terminology. We use N to denote the natural numbers {1,2,...}. 
Throughout the paper a ring will always mean a commutative ring with identity. 
In Sections [3][6] R will be a PID, unless stated otherwise. 

Let R be a ring. We denote the set of maximal ideals of R by Specmi? and 
the ring of n x n matrices over R by M n (R). For A, B G M n (R) we call [A, B] = 
AB - BA the commutator of A and B. Let A eM n (R). A matrix B G M n (R) is 
said to be similar to A if there exists a g G GL„(i?) such that gAg^ 1 — B. The 
transpose of A is denoted by A T and the trace of A by tr(A). We write Cm„(_r)(^4) 
for the centraliser of A in M n (R), that is, 



Let f(x) = ao + a±x + ■ ■ - + x n G R[x] be the characteristic polynomial of A. We will 
refer to the companion matrix associated to A (or to /) as the matrix C G M„(i?) 
such that 



The identity matrix in M„(i?) is denoted by 1 or sometimes 1„. For u, v G N we 
write E uv for the matrix units, that is, E uv — (e^) with e uv = 1 and = 
otherwise. The size of the matrices E uv is suppressed in the notation and will be 
determined by the context. 



Let G be a reductive algebraic group over a field K with algebraic closure K. An 
element x G G = G(K) is called regular if dimCc^) is minimal, and it is known 
that this minimal dimension equals the rank rkG (see |16| and [2j Section 14]). 
Similarly, if q is the Lie algebra of G an element X G q{K) is called regular if 
dimCc(A) — rkG, where G acts on g via the adjoint action. In the case G = GL„ 
there are several equivalent characterisations of regular elements in g(K) = M n (K). 
More precisely, the following is well-known: 

Proposition 2.1. Let K be a field and X G M n (K). Then the following is equiv- 
alent 

i) X is regular, 

ii) There exists a vector v G K n such that {v,Xv, . . . , X n ~ 1 v} is a basis for 
K n over K, 

Hi) The set {1, X, ... , I"" 1 } is linearly independent over K, 
iv) X is similar to its companion matrix C as well as to C T , 



C Mn(R) (A) = {B G M n (R) | [A,B] - 0}. 




■i-i 



for 1 < i < n — 1, 
for 1 < i < n, 
otherwise. 



2. Regular elements 



v) 



C Mn[K) {X) =K[X 
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Regular elements of M n (K) are sometimes called non- derogatory or cyclic. For 
matrices over arbitrary rings we make the following definition. 

Definition 2.2. Let R be a ring. A matrix X G M n (i?) is called regular if there 
exists a vector v £ R n such that {v, Xv, . . . , X n ~ 1 v} is a basis for i?" over R. 

Proposition 2.3. Let R be a ring and X G M„(i?). Then the following is equivalent 
i) X is regular, 

ii) X is similar to its companion matrix C as well as to C T , 
iii) C Mn{R) (X) = R[X}. 

The proof of Proposition 12.31 is the same as in the classical case of matrices over 
fields. In the following we will use the properties of regular elements expressed in 
Propositions 12.11 and 12.31 without explicit reference. 

If if : R — > S is a homomorphism of rings we also use (p to denote the induced 
homomorphism M n (R) -> M n (S). 

Lemma 2.4. Let (p : R — > S be a homomorphism of rings. If X £ M ra (i?) is 
regular, then <p(X) is regular. 

Proof. Suppose that X is regular. By definition there exists a vector v G R n such 
that {v,Xv, . . .,X n - 1 v} is an i?-basis for R n . Then {v®l,Xv®l, . .^X^v®!} 
is an S*-basis for R n <S> R S (cf. [7, XVI, Proposition 2.3]). Let ip(v) G S n be the image 
of v under component-wise application of (p. Under the isomorphism R n ig>nS — > S n , 
the elements X l v® 1 are sent to <p(X) l ip(v), so {<p(v), <p(X)(p(v), . . . , p(X) n ~ 1 <p(v)} 
is a basis for S n . Thus tp(X) is regular. □ 

Let R be a ring and X G M„ (R) . If p is an ideal of R we use X p to denote the 
image of X under the canonical map it : M„(i?) — > M n (R/p), that is, X p = tt(X). 
For a general ring R an element in M„ (R) which is regular modulo every maximal 
ideal may not be regular. However, if R is a local ring, the situation is favourable: 

Lemma 2.5. Assume that R is a local ring with maximal ideal m. Then X G 
M. n (R) is regular if and only if X m G M„(i?/m) is regular. 

Proof. If X is regular, then X m is regular by Lemma 12.41 Conversely, suppose 
that X m is regular and choose v G (R/m) n such that (R/m) n — (R/m)[X m ]v. Let 
v G R n be a lift of v. Then R n = R[X]v + mM for some submodule M of R n , and 
Nakayama's lemma yields R n — R[X]v , so X is regular. □ 

Proposition 2.6. Let R be an integral domain with field of fractions F, and let 
X G M n (R). If X m is regular for some maximal ideal m of R, then X is regular as 
an element o/M„(f). 

Proof. Suppose that X m is regular for some maximal ideal m of R. Let R m be the 
localisation of R at m, and let j : R — > R m be the canonical homomorphism. Since 
the diagram 

R ^ R m 

R/m — -R m /m 

commutes, Lemma [2.51 implies that j(X) is regular. If Y^i=o r iX l = for some 
r; G R, then J27=o i( r «)i(^) 1 = 0- But since j(X) is regular, we must have 
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j( r i) — f° r all i = 0, ... ,71 — 1. Since R is an integral domain j is injective, so 



Ti = for i = 0, . . . , n — 1. Now, if J2"=o s iX l = for some Sj G F, then clearing 



denominators shows that Si — for all i = 0, . . . , n—1. Hence, by Proposition ^. 1 Hi) 



the matrix X is regular as an element of M„ (F) . □ 

The following result has appeared in [20, Proposition 6]. 

Lemma 2.7. Let R be an arbitrary ring and A = (a y -) G M„(i?) a matrix such 
that a i i+ i — 1 for all 1 < i < n and = for all j > i + 2. Then A is regular. 

Proof. Let {ei = (1, 0, . . . , 0) T , e 2 = (0, 1, 0, . . . , 0) T , . . . , e„ = (0, . . . , 0, 1) T } be the 
standard basis for R" . Then the matrix 

B = (ei,Aei,...,A n - 1 e 1 ) 

is upper triangular with Is on the diagonal, so B G SL„(i?). Now for 1 < i < n— 1 
we have 

B~ l ABei = B^A^x = e l+1 
(since -Be^+i = A l ei ). Thus B _1 AB is a companion matrix, and so A is regular. □ 

3. The criterion of Laffey and Reams 

Throughout this section R is a PID and F its field of fractions. In Theorem 13.31 
we give a criterion for a matrix in M„(i?) to be a commutator discovered by Laffey 
and Reams (BJ Section 3]. This criterion plays an important role in our proof of 
the main theorem. Laffey and Reams proved the criterion for matrices over fields 
and over Z, and we only need minor modifications of their proofs, together with 
Proposition 12.61 to prove it over arbitrary PIDs. 

The following result is from [6l Section 3]. We reproduce the proof here for 
completeness. 

Proposition 3.1. Let K be a field and X G M n (K) be regular. Let A G M n (K). 
Then A = [X, Y] for some Y G M n (K) if and only if tr(X r A) = for all r = 
0,...,n-l. 

Proof. Since {1, X, ... , X™^ 1 } is linearly independent over K the subspace 

V = {A g M n (K) | tx(X r A) = for 0,1,..., n - 1} 

has dimension n 2 — n. The kernel of the linear map M n (R) — » M n (R), Y H> [X, Y] 
is equal to the centraliser CM Tl (if)(^), which has dimension n since X is regular. 
Thus the image [X, M n (K)] of the map Y i-> [X, Y] has dimension n 2 — n. But if 
A G [X,M n (K)] there exists a Y G M n (K) such that for every r = 0, 1, . . . , n — 1 
we have 

tr(X r ^l) = tr(X r (Xr - YX)) = tr(X r+1 Y) - tr{X r YX) = 0. 

Thus A G V and so LY,M„(if)] C V. Since dimV = dim[X, M n (K)] we conclude 
that V= [X,M n (K)]. □ 

Proposition 3.2. Lei X G M„(i?) 6e such that X v is regular for every maximal 
ideal p in R. Suppose that M G M„(F) is such that [X,M] G M n (R). Then there 
exists anY G M„(i?) suc/i ttai [X, M] = [X, Y]. 
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Proof. There exists an element m G R such that mY G M n (R), and we have 
[X, mY] = m[X,Y]. Assume that d G R is chosen so that it has the minimal 
number of irreducible factors with respect to the property that [X, C] = d[X, Y] for 
some C £ Mn(ii). If d is a unit we are done, so assume thatp is an irreducible factor 
of d. Then [X, C] G p M„ (R), so commutes with Ct p ) ■ But since X^ is regular, 
we have Ci v \ — f(Xi p \), for some polynomial f(T) G -Rp" 1 ]- Hence C — /(X) = pi) 
for some D G M„(i?). But this implies that [X, C] = [X,pD] = p[X,D] and thus 
(dp _1 )[X, Y] — [X, D], giving a contradiction to our choice of d. Hence d is a unit 
and so [X, Y] = [X, M] with M = d~ 1 C G M n (R). □ 

Proposition 3.3. Let A G M n (R) and let X G M n (R) be such that X p is regular 
for every maximal ideal p in R. Then A — [X. Y] for some Y G M„(i?) if and only 
if tr(X r A) = for r = 0,...,n- 1. 

Proof. Clearly the condition tr(X r A) = for all r > is necessary for A to be 
of the form [X,Y] with Y G M n (R). Conversely, suppose that tr(X r A) = for 
r = 0, 1, . . . , n — 1. By Proposition 12.61 X is regular as an element in M n (F) so 
Proposition 13.11 implies that A = [X, M] for some M G M n (F). But now the result 
follows from Proposition 13.21 □ 



4. Commutators over fields 

Let if be a field. Using the criterion of Laffey and Reams over fields (Proposi- 
tion 13. ip we give a swift proof of the theorem of Albert and Muckenhoupt [1] that 
every matrix with trace zero in M n (K) is a commutator. 

Note that if R is any ring and A,X,Y G M n (R) are such that A = [X, Y], then 
for every g G GL n (R) we have gAg~ x — [gXg" 1 , gYg^ 1 ]. Thus A is a commutator 
if and only if any matrix similar to A is. 

Let n G N with n > 2 and k — [n/2\. The following matrices were considered by 
Laffey and Reams [6j Section 4] who also established the properties stated below. 

( Pti = l far* = 2,4,...,2fc, 
P n = (Pij) = \ Pia-2 = 1 for i = 3, 4, . . .n, 
I pjj = otherwise. 

Depending on the context we will consider P n as an element of M„ (R) where R is 
a ring. For any m G N and a G R we will use J m (a) to denote the m x m Jordan 
block with eigenvalue a and Is on the subdiagonal. Over any R the matrix P n is 
similar to Jfe(l) © J„_fc(0) (cf. [6, p. 681]), and thus it is regular by Lemma l2~7l 

For any A = (a^) G M n (R) let c(A) = J2i=i a 2^2i and = 
Suppose now that R is a PID and that ay = for j > i + 2. Observe that for any 
r G N, P^ has the same diagonal as P and the entry of is if i ^ j and 
i < j + 2. Thus 

(4.1) ti(P r A) = c(A), for r G N. 

Proposition 4.1. Lei K be a field and let A G M n (if) fee a matrix with trace zero. 
Then A — [X, Y] for some X, Y G M„ (if) , where X is regular. More precisely, if 
A is non-scalar X can be chosen to be conjugate to P n , while if A is scalar we can 
take X = J„(0). 
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Proof. Assume first that A is non-scalar. It then follows from the rational normal 
form that A is similar to a matrix B — (bij) with 612 = 1 and bij = for j > i + 2, 
so we have A = gBg~ l for some g G GL„(i4T). Define z G SL n (K) as 

z = l + c{B)E 21 . 

Then the (i, j) entry of z~ x Bz is for j > i + 2 and c(z _1 Bz) = 0, so by (|4~Tj) 
we have tr(P^z _1 _Bz) = for r = 0, . . . , n — 1. By Proposition 13.11 it follows that 
B = \zP n z- x ,Y\ for some Y G M n (iT), and thus A = [ffzPnCff^) -1 ^^ -1 ]. 

Assume on the other hand that A is a scalar. Then tr( J„(0) r A) = for r = 
0, . . . ,n — 1, and Proposition 13. II implies that A = [J„(0), Y~], for some Y G M ra (JsT). 

□ 

5. Matrix similarity over a PID 

In this section we extend the results of [BJ Section 2] on similarity of matrices 
over Z to matrices over an arbitrary PID R. 

Lemma 5.1. Let A G M n (i2) 6e non-scalar, and let S be a finite set of maximal 
ideals of R such that A p G M„(i?/p) is non-scalar for every p G S*. Then A is 
similar to a matrix B = (bij) G M n (R) such that &12 ^ p for all p G S. 

Proof. It is well known that for any PID R and any non-zero ideal a of R the 
natural map 

(5.1) SL„(i?) — ► SL„(i?/a) 

is surjective. This follows for example from the fact that R/a is the product of local 
rings and that over local rings SL„ is generated by elementary matrices (see |121 
2.2.2 and 2.2.6]). Moreover, if we take a = IlpesP the Chinese remainder theorem 
implies that we have an isomorphism 

(5.2) SL„(i?/a) =^ J] S M-R/P)- 

pes 

Let p G S. Since A p is non-scalar and R/p is a field the rational canonical form for 
matrices in M ra (i?/p) implies that there exists a g p G GL„(i?/p) such that gpApgp 1 
is a matrix whose (1,2) entry is non-zero. Since GL„(i?/p) = T(R/p) SL n (R/p), 
where T(R/p) is the diagonal subgroup of GL n (R/p), we may take g v to be in 
SL n (i?/p). Suppose that g p is chosen in this way for every p G S. By the surjectivity 
of the maps (15.11) and (15.21) . there exists a g G SL n (R) such that the image of g in 
SL n (i?/p) is <7 p for all p G S. Let B — (bij) — gAg^ 1 . Then B is a matrix such 
that 612 is non-zero modulo every p G S. □ 

The following lemma will be used repeatedly in the proof of Proposition 15 . 31 and 
Theorem l5.6l It can informally be described as saying that if the off-diagonal entries 
in a row (column) of a matrix A G M„(i?) with n > 3 have a greatest common 
divisor d, then A is similar to a matrix in which the corresponding row (column) 
has off-diagonal entries d, 0, . . . , 0. 

Lemma 5.2. Let A = (a^) G M n (R), n > 3. Let 1 < u < n and 1 < v < n 
be fixed. Let r G R be a generator of the ideal (a u j \ 1 < j < n, u ^ j), and let 
c G R be a generator of the ideal (a^ | 1 < i < n, i ^ v). Then A is similar to a 
matrix B — (bij) such that if u = 1 we have b U 2 = T and b u j — for all 3 < j < n, 
and if u > 2 we have b u \ = r and b U j = for all 1 < j < n such that j ^ {1, u}. 



SIMILARITY AND COMMUTATORS OF MATRICES OVER PRINCIPAL IDEAL RINGS 8 



Moreover, A is similar to a matrix C = (cy ) such that if v = 1 we have c 2v — r 
and Ci v — for all 3 < i < n, and if v > 2 we have c\ v = c and a v — for all 
1 < i < n such that i ^ {1, v}. 

Proof. The proof follows the lines of [TTJ Ch. Ill, Section 2]. For 1 < i < j < n and 

(£«)eSL 2 (i?),let 

Mij = Mij(x,y,z,w) 

= 1„ + (x- 1)E U + yEij + zE 0l + (w- 1)E ]0 e SL n (R). 

Note that = Mij(w, —y, —z,x). Let 3 < j < n. Direct computation shows 

that the first row in B\ := AM 2 j is 

(an, a\2X + a i3 z, a\ 2 y + a 13 w, a M , . . . , ai„) if j = 3, 

(au,ai 2 x + aijZ, ai 3 , . . . , ai^-i,ai 2 y + ayw, ai J+ i, . . . ,ai„) if j > 3. 

Now let 3 < j < n be the smallest integer such that ay ^ (if no such j exists the 
assertion of the lemma holds trivially for A and u = 1). Let d G R be a generator 
of (ai2,a]j) and set 

y = aijd^ 1 , w = — ai2<i _1 . 
Then {y,w) = (1) and hence x,z £ i? may be determined so that xiu — = 1. 
Thus Oi22; + fflijZ = — d. With these values of x,y,z,w all the entries of Ai in 
positions (1, 3), . . . , (1, j) are zero, and the (1, 2) entry generates the ideal (ai2, aij)- 
Repeating the process, let j < k < n be the smallest integer such that aik ^ 0. 
Then B 2 '■= B\M 2 k has all its entries (1, 3), . . . , (1, k) zero and its (1, 2) entry 
generates the ideal {a\2, aij, aik)- Proceeding in this way, we obtain a matrix 
B = (bij) similar to A such that bi 2 is a generator of (ay 2 < j < n) and 
bij = for 3 < j < n (the generator b\ 2 can be replaced by any other generator 
of (aij 2 < j < n) by a diagonal similarity transformation of B). This shows the 

existence of B for u = 1. For u > 2, observe that if we let W u — (w\j) e GL n (R) 

be any permutation matrix such that w$ — w u \ — 1, then 

A' = (4) = WuAW^ 1 

is a matrix such that a' n = a uu and {a'y | 2 < j < n} = {a U j \ 1 < j ' < n, u ^ j}. 
Informally, the off-diagonal entries in the u-th row of A are the same as the off- 
diagonal entries in the first row of A' , up to a permutation. Thus the existence of 
B for u > 2 follows from the argument for u = 1 above. 

For the existence of C for v = 1, let 3 < i < n and C\ M^AM^. Direct 
computation shows that the first column in Ci is 

(an, a 21 x + a 31 y, a 2 \z + a 3 iw, an, . . . , a nl ) T if i = 3, 

(an,a 21 x + a a y, a 31 , . . . , a^ 1A ,a 21 z + a a w, a i+lyl , . . . , a nl ) T if i > 3. 

Now let 3 < i < n be the smallest integer such that an ^ (if no such i exists the 
assertion of the lemma holds trivially for A and v = 1). Let e S i? be a generator 
of (a2i,Oii) and set 

z = a^ie -1 , w = — a2i<i -1 . 

Then (z,ui) = (1) and hence x,y £ i? may be determined so that xiu — = 1. 
Thus a 2 \x + any — —e. With these values of x,y,z,w all the entries of C\ in 
positions (3, 1), . . . , (i, 1) are zero, and the (2, 1) entry generates the ideal (021, an). 
Repeating the process in analogy with the above argument, we obtain a matrix C 
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satisfying the assertion of the lemma for v = 1. For v > 2 we may use the matrix 
W v as above to reduce to the case where v = 1. □ 

Proposition 5.3. Let A G M 3 (R) be non-scalar. Then A is similar to a matrix 
B = (^j) £ Ma(iZ) smc/i t/ia£ 612 | 6y /or aZZ i ^ j and 612 | (6.^ — 6jj) /or aZZ 

i<*,i<3. 

Proof. Write A = al + bA 1 , where a,6 e fl, 6 ^ and where, if A' = (aL-), we 
have (a^ — a'^.a'^ \ i ^ j,l < i,j < 3) = (1). Note that A p is non-scalar for every 
maximal ideal p of R and that the proposition will follow for A if we can show it 
for A' , that is, if we can show that A' is similar to a matrix whose (1, 2) entry is 
a unit. Without loss of generality we may therefore assume that A = A' so that A 
satisfies 

[an - " I i 5^ i, 1 < i,j < 3) = (1). 

Note that any matrix similar to A will also satisfy this. Since A v is not scalar for 
any maximal ideal p of R, Lemma 15.11 with 



S := {p £ Spccmi? | \R/p\ = 2} 

implies that A is similar to a matrix B — (bij) such that &12 ^ p for all p G S. Among 
all such matrices choose one for which the number of distinct primes which divide 
612 is least possible, and subject to this, for which the number of not necessarily 
distinct prime factors is minimal. By Lemma 15.21 applied to the first row in B, we 
see that there exists a matrix B' similar to B whose (1,3) entry is zero and whose 
(1,2) entry, being equal to (012,613), has no more distinct prime factors than b\2- 
Hence we may assume that B has been replaced by B' so that 613 = 0. We thus 
have the following condition on B: 

The matrix B = (bij) is similar to A, 612 ^ p for all p € S, 613 = 0, the entry 

, , 612 has the smallest number of distinct prime factors among all the matrices 
( ^ ) 

similar to A and among all matrices with these properties B is such that 612 
has the minimal number of not necessarily distinct prime factors. 

Note first that by Lemma 15.21 applied to the second column in B, there exists a 
matrix similar to B whose (1,2) entry is a generator of (612,632). Thus, by (*) we 
must have 612 | 632, so 632 = 612a for some a G R. Let 

B 1 = (&g>) = (1 - E 31 a)B{l - E^a)- 1 . 
Then b$ = b 12 and b$ = b$ = so that 



£1 = 



In particular, B' satisfies (*). 

Claim 5.4. The entry 612 divides both bi^ — b$ and b^ 1 . 

Let y G R. The first row of the matrix (1 + Ei 3 y)Bi(l + Ei 3 y)~ 
(b$+yb$,b 12 ,y(bV-b$- y b$)). 
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Thus, by (*) and Lemma [5721 applied to the first row in (1 + Ei 3 y)B\(l + E\ 3 y)~ 



we conclude that b\ 2 divides y(b y ^ — b$ — yb^\') for any y £ R. Let 

(6i 2 )=pr-p5- 

be the factorisation of (612), where v £ N, e 2 ; £ N and the ideals pi £ Specmi? are 
distinct for 1 < i < v. By (*) we know that i?/pi| > 3, so there exist elements 
2/1,2/2 6 R/pi sucn that 

yiT^O, y 2 ^0, Vit l V2- 
Let 2/1,2/2 £ be lifts of y 1 and y 2 , respectively, that is, is the image of th- 

under the canonical map -R/p^ 1 — > R/pi, for j = 1, 2. Since -R/pi 1 is a local ring, 
2/1, t/2 and ?/i — 2/2 are units. By the Chinese remainder theorem we have 



R/(h 



Let Xj = (j/j, 1, . . . , 1) £ Yli—i R/pT 1 f° r 3 = 1' 2. Then Ai and A2 can be considered 
as elements in R/{b\ 2 ) and as such they are units with the property that Ai — X 2 
is also a unit. Thus Ai, X 2 and Ai — A2 are all coprime to 612. We know from the 
above that 612 divides 2/(^33 — &11 — 2/&31 ) f° r any y £ R. In particular, choosing 

A 2 ) G (612), hence 6^ 6 (612) 



Ai, A2, Ai — A2, respectively, we obtain & 3 ^(Ai 
£ (bi 2 )- This proves the claim, 
there exist elements a, j3 £ R such that 



and b£) - b {1) 



"33 "11 

By Claim 



J 33 



ab\ 2 and 6 



(i) _ 

31 — 



db 



12- 



Let 

B 2 
Then & 



($>) = 

(2) _ i(2) 



12 



'32 



(1 + S 21 (-q + /3))(1 + B 3 i)5i(l + E 31 y\l + £ 21 (-Q + /3))" 
so that 



612 and 



J 2 ) 



B 2 



31 

/^l 

(2) 
21 

V 



612 

& (2) 

bi2 



Moreover, let 



B' 2 = (1 - E 31 )B 2 (1 - E, 



31 J 





bg\ 



6g? 

b$+b$ 

^°33 ~~ °11 



"12 
'22 





and 



(1 - E 33 )B 2 (l - E; 



33 J 




We will now show that B 2 has the property that 612 | b\j for all i ^ j and 
&12 I {bff — bjj) for all 1 < i,j < 3. This follows from the following fact applied to 
the matrices B' 2 and B 2 '- 

Claim 5.5. Suppose that C = (c^) £ M n (R) satisfies (*) and that c 32 = 0. Then 



Cij for all i,j such that (2,1) and i ^ j, and C12 



C12 

l<*,j<3 



) for all 
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To prove the claim, let x E R and 

X = ( Xij ) = (1 + E 32 x)C(l + E 32 x)-\ 

Then 

X12 = C12, 

•^32 = x(c 2 2 - C33 - XC 23 ), 

and by Lemma 0x21 applied to the second column in X we conclude that c\2 divides 
^(c22 — C33 — XC23) for any x E R. By (*) and the same argument as in the proof 
of Claim we obtain 

C12 I (c 2 2 ~ c 33 ) and C12 I c 23 . 

Next, for y E R let 

Y = (Vij) = (1 + ^13J/)C(1 + E 13 yr\ 

Then 

J/12 = C12, 

J/13 = 2/(c33 - Ch - 2/C31), 

and by Lemma 15.21 applied to first row in Y and the same argument as for the 
matrix X (that is, using (*) and the same argument as in the proof of Claim [5T^|) 
we obtain 

C12 | (c 3 3 - cu) and c i2 | c 31 , 
whence also C\>z \ (c 22 — Cn). This proves Claim [5~51 for C. 

Applying Claim l5~S"l to the matrices B' 2 and B 2 , respectively, we conclude that B 2 

(2.) (2) (2) 

has the property that bi 2 | for all i ^ j and bi 2 | (b^' — ) for all 1 < i,j < 3. 
Since i3 2 is similar to _B (and _B is similar to A), we have 

(6« - bjj^ij I i ^ j, 1 < i,j < 3) = (1), 

so &12 must be a unit. This proves the proposition. □ 

We now use Proposition 15.31 to prove the corresponding result for matrices in 
M n (R) for all n > 3. More precisely, we have 

Theorem 5.6. Let A E M. n (R) with n > 3, be non-scalar. Then A is similar to 
a matrix B = (6y) E M„(i?) such that b\ 2 \ bij for all i =/= j and b\2 \ (pa — bjj) 
for all 1 < i,j ' < n. Moreover, B may be chosen with bij — for all i,j such that 
j > i + 2 and 1 < i < n — 2 . 

Proof. As in the proof of Proposition 15.31 we may assume that 

(a« - ajj,aij \ i ^ j, 1 < i, j < n) = (1), 

and choose a matrix B satisfying the following condition 

The matrix B = (bij) is similar to A, (612,2) = (1), b%j — for j > 3, the 
, . entry b± 2 has the smallest number of distinct prime factors among all the 
matrices similar to A and among all matrices with these properties B is such 
that 612 has the minimal number of not necessarily distinct prime factors. 

If for some i,j the entry 612 does not divide bn — bjj, then b\ 2 does not divide 

611 — b vv for some v. If v > 4 let W v = (wff) E GL„(_R) be any permutation 

matrix such that = = 1, = 1 and w^J = 1. Then W V BW~ X has 
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(1,2) entry equal to 612 and (3,3) entry equal to b vv , so we may assume that 612 
does not divide bu — 622 or bu — 633. Consider the submatrix 

of B and note that any similarity Bq i-> g Bog for g £ GLs(i?) may be achieved 
by B i— > (g © I n _ 3 )B(g © J n _3) . By the minimality property of 612 expressed in 
(*) and the argument in the proof of Proposition 15.31 applied to -Bo we conclude 
that 612 divides both bu — 622 and bu — 633, which is a contradiction. Thus 

b\2 I (bu — bjj) for all 1 < i,j < n and b\2 \ bij for all j, 1 <i,j < 3. 

Similarly, for any 4 < v < n the matrix W V BW~ Y has (3, 1) entry equal to b v i, so 
by (*) and the argument in the proof of Proposition 15.31 applied to Bo we conclude 
that 612 I foui. Hence 

612 I foul for all 4 < v < n. 
Furthermore, by (*) and Lemma |5 . 2 1 applied to the second column in B, we see that 

bi2 I b l2 for all i ^ 2. 

Let 1 < u, v < n be such that u > 3 and o/ii. For x £ R let 

X u = (4 u) ) = (1 + K 2 )^(l + K2)" 1 , 

so that a;^ — fou2 — b vu and in particular = b\2- By (*) and Lemma l5.2l applied 
to the second column in X u we see that 612 | and since 612 | fo«2 we conclude 
that 612 I b vu . Hence 

^12 I b vu for all u > 3, v ^ u. 
We have thus shown that B has the property that 612 | bij for all i ^ j and 
612 I {bu - bjj) for all 1 < i,j < n. 

For the second statement we follow |11[ III, 2]. Conjugating B by I2 © M^j £ 
GL n (R) for a suitable M^j £ GL„_2(i?) (cf. the proof of Lemma l5.2[) . we can 
replace B by a matrix B\ in which the first row equals that of B and whose (2, j) 
entries are zero whenever j > 4. Conjugating B\ by I3 © M±j £ GL n (R) for a 
suitable M±j £ GL n -z{R), we can replace B\ by a matrix B2 in which the first 
two rows equal those of B\ and whose (3, j) entries are zero whenever j > 5. 
Proceeding inductively in this way, we obtain a matrix C = (cij ) similar to B such 
that C12 = 612 and Cjj = for i,j such that j > i + 2 and 1 < i < n — 2. But 
since B = &nl„ mod (612) we also have C = foul n mod (612), so C has the desired 
form. □ 

Using Theorem 15.61 it is now easy to prove the following result. The following 
proof is entirely analogous to that of Laffey and Reams for R = 7L. 

Proposition 5.7. Let A £ M„(i?) ; n > 3 have trace zero, and suppose that for 
every p £ Specmi? and every a £ R/p, a ^ we have Ap ^ al„. Then A is similar 
to a matrix B — (bij) £ M n (R) where bu — for all 1 < i < n. 

Proof. If Ap = for some p, we can write A — mA' , where m £ R and A' is such 
that for every p £ Specmi? and every a £ R/p we have A' p ^ al n . Since A' must 
be non-scalar Theorem 15.61 implies that A' is similar to a matrix A" = (a"j) such 
that a'{ 2 I a'lj for all i 7^ j and a'{ 2 \ (o^ — a'^) for all 1 < i, j < n. Since A" satisfies 
Ap 7^ al n for any p £ Specmi? and a £ R/p, the entry a" 2 must be a unit. Wc 
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may therefore assume without loss of generality that A = A" , so that in particular 
a\2 is a unit. 

We now prove that A is similar to a matrix with zero diagonal by induction on 
n. If n = 2, the matrix 

(1 + E 2 iana^)A(l + E^aua^)^ 1 

has zero diagonal. If n > 2, conjugating A by a matrix of the form 1 + aE n i, 
a £ fl, we may assume that a n i — 1, and then conjugating A by a matrix of the 
form 1 + PE211 P € R, we may further assume that an = 0. Thus we may assume 
that A is of the form 




where i,j 6 i?™ -1 , Ai = (a*,) G M„_i(i?) with a^_ 1;1 = 1 and tx(Ai) = 0. By 
Theorem 15.61 A\ is similar to a matrix A 2 = (afj ) such that 2 | af, for all i ^ j 
and af 2 | (a^ — a|-) for all 1 < i, j < n. Since (A2)p ^ al n for all p 6 Specmi? and 
a G R/p, the entry a\ 2 must be a unit. So by induction there exists a Q G GL„_i (i?) 
such that QAiQ^ 1 = B\ is a matrix with zeros on the diagonal. But then 

B={l 1 ®Q)A{l 1 ®Q)~ l 

has the desired form. □ 

A matrix in M„(i?) satisfying the conditions on the matrix B in Theorem 15.61 
will be said to be in Laffey- Reams form. 



6. Proof of the main result 

In this section we give a proof of our main theorem on commutators. Theorem l6.3l 
We first prove a couple of lemmas used in the proof. 

Lemma 6.1. Let R be a PID. Then the following holds: 

i) Let r G N and let cti,(3i 6 R be such that (ai,/3i) = (1) for all 1 < i < r. 
Then for every finite set S of maximal ideals of R there exists an x G R 
such that for all p G S and all 1 < i < r we have ai + fiiX ^ p. 

ii) Let a, /3 G R be such that (a, j3) = (1). Suppose that p is a maximal ideal of 
R such that \R/p\ > 3. Then for every finite set S of maximal ideals of R 
such that p ^ S there exists at G R such that t ^ p, t G q for all q £ S \ {p} 
and at + (3 ^ p. 

in) Let a,b,c G R be such that (a, 6, c) = (1), (a, 6) =/= (1) and (a,c) 7^ (1). 
Then there exists an x G R such that (a + cx, b ~ ax) = (1). 

Proof. To prove [^J suppose the opposite, that is, that for all x G R there exists 
some p G 5* and some 1 < i < r such that ai + fiiX G p. Then since R has infinitely 
many elements there exists two elements y,z G R and some p G S and i such that 
y — z ^ p, ai + (iiy G p and ai + (3iZ G p. But since |i?/p| > 2 and R/p is a field, 
this implies that ai G p and G p, which contradicts the hypothesis. 



Next, we prove ii) Since |i?/p| > 3 there exist two elements ri,r 2 G R \ p such 
that r\ — r2 ^ p. Let s G R be such that 

00= n i- 

qes\{p} 



SIMILARITY AND COMMUTATORS OF MATRICES OVER PRINCIPAL IDEAL RINGS 14 



Then for i = 1,2 we have r^s ^ p and r;s 6 q for all q 6 S \ {p}. Furthermore, 
if ariS + (3 £ p fori = 1,2, then a £ p and /? £ p, contradicting the hypothesis 
(ol,0) — (1). Thus we may assume that aris + (3 ^ p, and t = r±s yields the desired 
element. 



We now prove Hi) We first show that a + cx and b — ax are relatively prime 
as elements of R[x], that is, that none of them is a multiple of the other. Indeed, 
if a + cx = m(b — ax) for some m £ R, then a = mb and c = —ma so (1) = 
(a,b,c) = (mb, b, — m 2 b) = (b), which is impossible since (a, b) ^ (1). Similarly, 
if n(a + cx) — b — ax for some n £ R, then b = na and a = —nc so (1) = 
(a, 6, c) = (—nc, —n 2 c, c) — (c), which is impossible. Let K be the field of fractions 
of R. Since a + cx and b — ax are relatively prime as elements of R[x], they are 
relatively prime as element of 2<f[a;]. Thus there exists /o,<?o £ such that 

(a + cx)fo + (b — ax)go = 1, and so there exists some /, g 6 i?[x] such that 

(6.1) (a + cx)f + (b-ax)g = D e R\{0}. 

Let di be a generator of (a,b) and ^2 a generator of (a,c). Note that d\ ^ and 
d 2 /0 since 6^0 and c ^ 0. Write 

a + = d\(a' + ci) 

b — ax = d^(p — a"x), 

with (a',c') = (1) and (b',a") = (1). Let S be the set of maximal ideals dividing 
(d\d2D). By [7]] (with r = 2) we can choose x <E R such that for all p £ 5 we 
have a' + c'x ^ p and b' — a"x p. If p 6 i? is a prime dividing both a + cx and 
& — ax, then since (d±, d%) = (1) we must have that p divides at least one of a' + c'x 
and b' — c'x. But by our choice of x this implies that p does not divide d\d2, and 
therefore p divides both a' + c'x and b' — c'x. Hence, by (|6.1[) we get that p divides 
D, which is impossible by the choice of x. This shows that a + cx and b — ax are 
relatively prime. □ 



The special case of Lemma I6.1|^| when r = 1 will be especially useful for us and 
can also be proved by taking x to be a generator of the product 

np 

pes 

or letting x = 1 if there is no p £ S such that a\ £ p. 

The following result is the Chinese remainder theorem for centralisers of matrices 
over quotients of R. It will be used at a crucial step in our proof of Theorem [ 



Lemma 6.2. Let X £ M„(i?) and let pi, . . . ,p^ 7 v £ N be maximal ideals in R. 
Then the map 

c M„(ii/(p 1 -p„))(^(pi-p„)) — ► HCm^a/pO^p.) 

1=1 

a 1 — > (spi,---,5pJ, 

is an isomorphism. 

Proof. Let C = C , M„(i?/(p 1 ...p 1 ,))(^(p 1 ...p„)). Then C is a module over R. By the 
Chinese remainder theorem we have an isomorphism i?/(pi • • -p^) — > Yii=i R/pi 
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given byoH (a Pl , . . . , a P „), and tensoring this by C yields 

C s R/{pi ■ ■ ■ p v ) ® R C £* ( JJ (8> fl C = IJWfc ®* c ) 

i=l i=l 

8=1 

Tracking the morphisms shows that the composition of the above isomorphisms is 
given by 

g — ► 1 ®g i — ► (l Pl ,. . ., l p J ® g i — ► (l Pl ® . . . , l p „ (g) 5) i — > (g Pl ,.. .,g Pv ). 

□ 

We now give the proof of our main theorem. Note that our proof in the case 
n = 2 is different from the case n > 3, and that for n = 2, while our argument is 
not the shortest possible, yields the stronger result that any A £ M.2(R) with trace 
zero can be written as A = [X, Y] for some X, Y £ M.2(R) and X regular. 

Theorem 6.3. Let R be a PID and let A £ M„(i?) be a matrix with trace zero. 
Then A = [X,Y] for some X, Y £ M n (R). 

Proof. First assume that n — 2. By taking out a suitable factor we may assume 
that the matrix 

a b 



A = 



c —a 



satisfies (a,b,c) = (1). Let X = ( ° 1 J £ M 2 {R). By Lemma 12771 the matrix X 

yXi X2 J 

is regular so it is regular mod p for every maximal ideal p of R. Furthermore, 

ttiXA) = bx\ — 0x2 + c, 

so if (a, b) = (1) we can find x% and x 2 such that tr(XA) — 0, and Proposition 13.31 
implies that A — [X, Y] , for some Y £ M 2 (R) . Similarly, the transpose X T of X is 
also regular, and 

tr{X T A) — cx\ — ax 2 + fo, 
so if (a, c) = (1) we can find x% and x 2 such that tr(X T A) = 0, and soi= [X T , Y], 
for some Y £ M 2 (R). Hence, in case (a, b) = (1) or (a, c) = (1) we are done. Assume 
therefore that (a, b) ^ (1) and (a,c) ^ (1). If we let T = 1 + xE 1 ^ £ M 2 (R) for 
some x £ R, we have 

'a + cx 6 — ax — x(a + cx) 



TAT' 



Now a + cx and b — ax — x (a + cx) are relatively prime if and only if a + cx and 
— ax are relatively prime. By Lemma I6.l|mj| we can choose x £ R such that 
(a' + c'x, b' + a"x) = (1), and hence such that the (1, 1) and (1, 2) entries in TAT^ 1 
are relatively prime. As we have already seen, this means that we can find x% and 
x 2 such that tr(ATAT _1 ) = 0, so Proposition [O] yields A = [T^XT^] for some 
Y £ M 2 {R). 

Assume now that n > 3. If A is a scalar matrix we obviously have tr( J n (0) r A) = 
for all r > 0, so Proposition 13.31 yields the desired conclusion. We may therefore 
henceforth assume that A is non-scalar. Write A = (a^) for 1 < i,j,< n. By 
Theorem 15.61 we may assume that A is in Laffey- Reams form. If d £ R is such 
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that (a,ij,au — djj | i ^ j, 1 < i, j < n) = (d), we can write A = cL4/ where 
A' = (a'ij) G M„(i?) is in Laffey-Reams form and (a'n,a' 12 ) = (1). It thus suffices 
to assume that A = A' so that (an, 012) = (1), 612 | bij for i ^ j, 612 | (bu — bjj) 
for 1 < i,j < 3, and = for j > i + 2. Let k = [n/2\. For x,y,q G i? define the 
matrix X = G M„(i?) by 



(Xij) 



xu = -y for % = 2,4, ... ,2fc, 
X2i = a;, 

^31 = 3, 

=1 for j = 3,4, . . . ,n, 



x 



= otherwise. 



We have 



tr(XA) = xai2 + a 23 H h a„_i,„ - yc(A). 



We claim that tr(XA) = implies that tr(X r A) = for all r > 0. To see this, 
observe that the matrix X 2 + yX is lower triangular and its entry is if 

j > i — Since tr(EijA) = if j < i — 1 (since = for j > i + 2), it follows that 
tr((X 2 + yX)A) = 0, so if ti(XA) = we get tr(X 2 A) = 0. More generally, using 
the fact that X is lower triangular, we have tr((X r + yX r ~ 1 )A) = 0, and working 
inductively we get ti(X r A) — for all r > 0. 

We now show that there exist elements x,y G R with (x, y) = (1) and such that 
tr(XA) = 0. Consider the equation 

xau + a 23 H h a n -i,n = yc(A), x,y e R. 

Since ai 2 divides 023, . . . , a„_i in , this may be written 

(6.2) a 12 {x + l)=yc{A), 

for some I G R. Let d G i? be a generator of (ai2, c(A)). Then (|6.2[) is equivalent to 

x = hc^dT 1 - I 
y = hai2<r 1 , 

for any h G R. Choose h to be a generator of the product of all maximal ideals p 
of R such that a^d -1 G p and I ^ p (and let h = 1 if no such p exist). Suppose 
that (x,y) G (p) for some prime element p G R. Then y G (p) and so ai2<i -1 G (p) 
or h E (p). If a^d -1 G (p) and Z ^ (p), then Zi G (p), so x £ (p). If a^d -1 G (p) 
and Z G (p), then Zi ^ (p) and since (ai^d -1 , c(A)d~ 1 ) = (1) we have ir ^ (p). 
Furthermore, if h G (p) then Z ^ (p) so x ^ (p). Thus (x,y) = (1), so in particular 
x 2 ai2 ^ (2/012). Since ai2 divides each of an — 022, 021, 031 and 032, we have 
xy(an - 022) - J/ 2 (a2i + 2/031 + xa 32 ) G (2/012). Thus, we must have 

(6.3) x 2 ai2 + xy(a\\ - C122) - y 1 {a 2 \ + ya 3 i + xa 32 ) ^ 0. 

From now on let x and y be as above, so that (x, y) = (1) and tr{XA) — 0. Next, 
we specify the entry q in X . 

Let So be the set of maximal ideals p of R such that x 2 di2 + xy(an — 022) — 
y 2 (a 2 i + ya 3 i + xa 32 ) G p, and let 

S = S a U {p G Specmi? | \R/p\ = 2}. 
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Note that S is a finite set because of (|6.3|) together with the fact that for any PID 
R' (or any Dedekind domain) , there are only finitely many p £ Specm R' such that 
l-R'/Pl = 2. By Lemma l6.l[I)| (with r = 1) we can thus choose q £ R such that 

x + qy ^p, for all p £ S. 

Let V be the set of maximal ideals of R such that x + qy £ p, that is. 

V = {p e Specm R \ x + qy ep}. 

By the choice of q we thus have that 

(6.4) p £V => x 2 a 12 + xy(an - a 22 ) - y 2 (a 2 i + ya 3 i + xa 32 ) £ p- 

Note that for every p £ V we have y ^ p since (a;, y) = (1). Note also that SDV = 0. 

We claim that X p £ M n (R/p) is regular for every maximal ideal p not in V. To 
show this, let p £ Specm R \ V and let 

M=( x + ™ °)®l n - 2 £M n (R). 



Since x + qy $ p the image M p £ M n (R/p) of M is invertible and, letting y p denote 
the image of y in R/p, we have 

{mu = -yp for i = 2,4, ...,2k, 
mjj-i=l for j = 2,3,..., n, 
m uv = otherwise. 

It follows from Lemma [2.71 that M^XpM^ 1 is regular, and thus X p is regular. 

By our choice of q we have p ^ V if p £ S, so X p is regular for any p g S, and 
5 is non-empty. By Proposition 12.61 we have that X is regular as an element in 
M n (F), where F is the field of fractions of R. By our choice of x and y we have 
tr(X r A) = for r = 0, 1, . . . , n — 1, so Proposition 13. II implies that we can write 
A = [X, Q] , for some Q £ M„ (F) . Clearing denominators in Q we find that there 
exists a non-zero element mo £ R such that mo A £ [X, M n (R)}. We now highlight 
a step which we will refer to in the following: 

Let m £ R be such that it has the minimal number of (not necessarily distinct) 
(*) prime factors among all m' £ R such that m! A £ [X,M n (R)], and let Q £ 
M n (R) be such that mA = [X, Q]. 

We show that the only maximal ideals containing m are those in V . Suppose that 
p = (j>) £ Specm R \ V and that m £ p. Then = [Xp,Qp], and since X p is 
regular there exists a polynomial / £ R[T] such that Q = f(X) + pQ' for some 
Q' £ M n (R), so mA = [XJ{X) + pQ'} = [X,pQ'\ and thus mp^A = [X,Q'\, 
which contradicts (*). Thus, if m £ p for some p £ Specm R, then we must have 
p £ V. Let pi, p2, ...,p v , v £ N U {0} be the elements of V such that m £ pi. For 
each pi, choose a generator pi £ R, so that p^ = (pj), for i = 1, . . . , v. We then have 

m = p e iP e 2 2 ■■■pi", 

for some a £ N, 1 < i < v. 

The strategy is now to show that X can be replaced by a matrix X\ which is 
regular mod p for every p £ V. Let 

N= {q J) © !n-2 S M n (i?). 
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For ease of calculation we will consider the matrices 

A = NAN- 1 , X =NXN~ 1 , Q a = NQN^ 1 . 
Let p £ V be any of the ideals pi,p2; • • • iPu- We have 

(6-5) (*o) P =(o w) = ^® Wp ' 

where W p £ M n _i(i£/p) is regular. We wish to determine the dimension of the 
centraliser 

C(p) := C Mn ( JJ / p )((X )p). 
Since (x,y) — (1), we have y p ^ 0, so the Jordan form of {Xq) p is 

Jk(-Vp) © J»_fc_i(0) © Ji(0), 

where k = [n/2\, as before. We have C = C Mn (R/ P ) ( Jk(-yp)) © C Un ( R / p )(J k (0) © 
Ji(0)), and dimCM n (R/p)(Jk(~ Up)) = k so it suffices to determine the dimension 
of C Un (R/p)(J n -k-i(0) © ^i(O)). A matrix 

h=(hI\ £) GMn(jR/p) ' 

where Hu is a k x k block, H 22 is a 1 x 1 block, and the other blocks are of 
compatible sizes, commutes with J n -fc-i(0) © Ji(0) if and only if 

fliiJ„_,k-i(0) = ./„_*_! (Oj-Hu, H 12 £ ^ Q /P j , H 2 i £ (0,i?/p). 

Hence dimCu n (R/p)(J n -k-i(0) © Ji(0)) = n — k — 1 + 1 + 1 + 1, and so 

dimC(p) = n + 2, 

that is, (^o)p is subregular (cf. Q2]). Next, we need the dimension of (R/p)[(Xo)p\ 
(the algebra of polynomials in (X ) p over the field R/p). Since (R/p)[(X ) p ] = 
(0) © (R/p)[W p ] and Wp is regular, we have dim(R/p)[(X ) p ] = n - 1. 

We now find a basis for C(p). We know that (R/p)[(X ) p ] is an (n — 1)- 
dimensional subspace of C(p). Moreover, Bn and E±2 + ypE\ 3 are in C(p). Let 
k = n + 1 — 2|_(n + 1)/2J, that is, k is if n is odd and 1 if n is even. Then 
also E n i + ny p E n -i.i £ C(p), and since (^o)p is lower triangular and the first col- 
umn of (Ao)p is for all i G N, the intersection of (R/p)[(Xo) p ] with the i?/p-span 
(Eu, E 12 + y p Ei 3l E nl + ny p E n -i tl ) is 0. Since {£n, Ei 2 + y p E 13 , E nl + ny p E n _i A } 
is linearly independent, dim(i?/p)[(Xo) p ] = n — 1 and dimC(p) = n + 2, we must 
have 

(6.6) C{p) = ({R/p)[{X ) p ],E lu E 12 +y p E 13 ,E nl +Ky p E n ^ hl ). 

We observe that the matrix E n \ + KyE n -\^\ £ M n (R), whose image in M n (i?/p) is 
E n i + KJ/p^„-i,i, satisfies E n \ + nyE n - lt i £ C Mn ( R ){X ). 

Let a = nr=i P«> so ^ na ^ a — (Pi ' ' 'Pv)- By (16. 6[) and Lemma [6721 we have 

(6.7) C Mn (fl/„)(X a ) = ((fl/o)[(X )„] s £;ii,Bi2+tf«Sl3,Snl+KtfB^n-l,l)- 

Since [X ,Q ] = mA we have ([X ,Q ]) a = 0, that is, (Q ) a £ C Mn ( R/a) ((X ) a ). 
Hence, by (|6~T1) 

Qo = / (Jf ) + oEn + £(£12 + yE 13 ) + j(E nl + nyE n _ hl ) + Pl ■ --p^D, 
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for some a,/3,-y G R, f(T) G R[T] and D G M n (R). Using that £„i + nyE n - 1:1 G 
Cm„(a)(^o), we get 

(6.8) [X , Q ] = [X , f(X ) + aEn + /3{E 12 + yE 13 ) 

+ y(E nl + KyE n ^i t i) +pi ■ ■ -p v D\ 
= [X , aEu + /3(E 12 + yE 13 ) + p x ■ ■ ■ p v D\ 

= [X 0j Qi], 

where Q\ = aEu + fi(E\ 2 + yEi 3 ) + pi ■ ■ -p v D. 

Let i be such that 1 < i < v. If (a,f3) C pi then [X Q ,Qi] G p,[X , M n (i?)] and 
so mp^ 1 A G [X,M n (i?)], contradicting (*). Thus either a £ pi or j3 £ p t . We show 
that the case where a G pi and j3 ^ pi cannot arise. Since to^4o = PQbQiL we 
have mtr(QiAo) = 0, whence tr(QiAo) = 0. This together with a G pi and (3 £ pi 
implies that 

tr((£i2 +yE 13 )A ) G ft. 
Recalling that A = NAN^ 1 we thus get 

-q 2 a 12 + q(au - a 22 ) + a 21 + ya 31 - qya 32 G pi 
and, after multiplying by y 2 , 

-q 2 y 2 ai 2 + qy 2 (an - a 22 ) + y 2 {a 21 + ya 3 i - qya 32 ) G p,;. 
Since pi G V we have qy G —x + pi and so 

x 2 a 12 + xy(au - a 22 ) - y 2 {a 21 + ya 31 + xa 32 ) G pi. 
But by our choice of q we have 

x 2 a 12 + xy(au - 0-22) - y 2 {a 2 i + ya 3 i + xa 32 ) p, 

for all p G V, which is a contradiction. Therefore we cannot have a G pi and 
/3 ^ pi, so we must have a ^ pi. We have thus shown that for all i = 1, . . . , v we 
have a ^ pi. 

By Lemma l6.l|Mj| and our choice of S there exists a t G R such that for all 
i = l,...,v 

(6.9) t (£ pi, and at + y p L . 
Define the matrix 

X\ = Xq + tQ\. 

Let p be any of the ideals pi,p2, ■ • • ,pv Let a p ,/3 p ,t p denote the images of a, /3 
and t in R/p. respectively. As before, let y p denote the image of y in R/p. Note 
that if we let 

/l Ppa^ 1 ypPpap 1 ^ 
L p = 1 © 1„_ 3 G M n (R/p), 

Vo 1 J 

then L p {X\) p L p l = a p t p En © Wp, where W p is the matrix in (|6.5jl . Since 
is regular and neither of its eigenvalues or ~y p equals a p t p by (|6.9I1 . the matrix 
a p t p Eu (B W p , and hence (Xi) p G M„(i?/p), is regular. We thus see that (Xi) p is 
regular for all of the ideals pi,p2, • • • , pv 
By (16. 8p we have 

mA a = [X ,Q ] = [X ,Qi] = [X u Qt], 
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and since (Xi) Pi is regular and m 6 pi we get Q\ = gi(X\) + PiQ[, for some 
g t {T) G R[T] and Q[ € M„(i?). Thus 

mAo = +j>iQi] = pi[Xi,Qi], 

and so mp~ 1 Ao = [Xi.Q^]. Repeating the argument if necessary, we obtain Aq G 
mp~ ei [Xi,M n (R)]. Running through each i = 1, . . . , v we obtain A§ = [_X"i, F] for 
some Y E M n {R), and hence A = [N^X^, NYN^ 1 ]. □ 

By a theorem of Hungerford [5] every principal ideal ring (PIR) is a finite product 
of rings, each of which is a homomorphic image of a PID. Together with Theorem l6.3l 
this immediately implies the following: 

Corollary 6.4. Let R be a PIR (not necessarily an integral domain) and let A G 
M n (R), n > 2, be a matrix with trace zero. Then A — [X, Y] for some X, Y G 
M n (R). 

7. Further directions 

If R is a field or if R is a PID and n — 2, we have shown that every A G M n (R) 
with trace zero can be written A — [X,Y] where X, Y G M n (R) and X is regular. 
Our proof of Theorem 16. 31 shows that for any PID R, n > 2 and every A G M n (R) 
with trace zero we have A = [X, Y] for some X, Y G M„ (R) where X p is regular for 
all but finitely many maximal ideals p of R. Is it always possible to choose X such 
that Xp is regular for all maximal ideals p? An even stronger question is whether 
it is always possible to choose X regular. 

It is natural to ask for generalisations of Theorem 16.31 to rings other than PIRs. 
We first mention some counter-examples. It was shown by Lissner [8] that the 
analogue of Theorem 16.31 fails when n — 2 and R — k[x,y,z], where fc is a field, 
and more generally that for R = k[x\, . . . ,X2n-i] there exist matrices in M„(i?) 
with trace zero which are not commutators (see [H Theorem 5.4]). Rosset and 
Rosset P21 Lemma 1.1] gave a sufficient criterion for a 2 x 2 trace zero matrix 
over any commutative ring not to be a commutator. They showed however, that 
a Noetherian integral domain cannot satisfy their criterion unless it has dimension 
at least 3. This means that their criterion is not an obstruction to a 2 x 2 trace 
zero matrix over a one or two-dimensional Noetherian domain being a commutator. 
Still, if R is the two-dimensional domain R[x, y, z\j (x 2 + y 2 + z 2 — 1) it can be shown 
that there exists a matrix in M2 (R) with trace zero which is not a commutator (this 
example goes back to Kaplansky; see |17[ Section 4, Example 1], [9, p. 532] or [T3l 
Section 3]). 

A ring R is called an OP-ring if for every n > 1 every vector in f\ n l R n is 
decomposable, that is, of the form v\ A- • • At»„_i for some vi G R n . This is equivalent 
to saying that every vector in R n is an outer product (hence the acronym OP). The 
notion of OP-ring was introduced in [9]. In particular, for n — 3 the condition on 
R of being an OP-ring is equivalent to the condition that every trace zero matrix 
in M.2(R) is a commutator (see (11 Section 3]). It is known that every Dedekind 
domain is an OP-ring [51 p. 534] and more generally that every polynomial ring 
in one variable over a Dedekind domain is an OP-ring [TS1 Theorem 1.2]. This 
prompts the following problem: 

Problem. Let R be a Dedekind domain and assume that A G M n (R), n > 2, has 
trace zero. Is it true that A = [X, Y] for some 1,7 £ M„(i?)? 
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Since Dedekind domains are OP-rings the question has an affirmative answer for 
n = 2, and one could ask the same question for any OP-ring. In the setting of 
matrices over a Dedekind domain the methods we have used to prove Theorem 16.31 
are of little use because they rely crucially on the underlying ring being both atomic 
and Bezout, which implies that it is a PID. 
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